Outline: Applications of Neural Nets Nettalk -learning Pronunciation of English Text Classifying Sonar Targets 16.1 Nettalk 16.1.1 Overview Phoneme String Text Speech Figure 16.1: a Text-to-speech System Using Nettalk
نویسنده
چکیده
NETtalk is a classic example of a back-propagation trained multi-layer perceptron network applied to a practical application. NETtalk, created by Sejnowski and Rosen-berg 1], applies a multi-layer network to the text-to-speech problem. The goal is to develop a system which can convert English text into its underlying sequence of phonemes and stress markers. The string of phonemes and stress markers can then be used by a speech synthesizer to generate an audio realization of the text as seen in Figure 16.1. In using a network approach to the problem, it was hoped that NETtalk could learn a general mapping of spelling to pronunciation. Other current text-to-speech products such as DECtalk, utilize a dictionary lookup for common and irregular English words and apply a set of phonological rules to convert words which don't appear in the NETtalk DECtalk
منابع مشابه
Achieving High-Accuracy Text-to-Speech with Machine Learning
In 1987, Sejnowski and Rosenberg developed their famous NETtalk system for English text-to-speech. This chapter describes a machine learning approach to text-to-speech that builds upon and extends the initial NETtalk work. Among the many extensions to the NETtalk system were the following: a diierent learning algorithm, a wider input \window", error-correcting output coding, a right-to-left sca...
متن کاملParallel Networks that Learn to Pronounce English Text Terrence
This paper describes NETtalk, a class of massively-parallel network systems that learn to convert English text to speech. The memory representations for pronunciations are learned by practice and are shared among many processing units. The performance of NETtalk has some similarities with observed human performance. (i) The learning follows a power law. (;i) The more words the network learns, t...
متن کاملA Comparison of the Classic NetTalk Text-to-Speech System to a Modern, Distributed Representation and Simple Recurrent Network
This paper reports on a comparison to the well-known NetTalk implementation of Engl!sh text-to-speech translation via neural networks. A distributed representation scheme for encoding is investigated opposed to the classic localist representation scheme used in the original NetTalk. The paper discusses a modem re-implementation based on Elman’s Simple Recurrent Network.
متن کاملNettalk: a Parallel Network That Learns to Read Aloud
Unrestricted English text can be converted to speech by applying phonological rules and handling exceptions with a look-up table. However, this approach is highly labor intensive since each entry and rule must be hand-crafted. NETtalk is an alternative approach that is based on an automated learning procedure for a parallel network of deterministic processing units. ~ f t e r ' training on a co...
متن کاملParallel Networks that Learn to Pronounce English Text
This paper describes NETtalk, a class of massively-parallel network systems that learn to convert English text to speech. The memory representations for pronunciations are learned by practice and are shared among many processing units. The performance of NETtalk has some similarities with observed human performance. (i) The learning follows a power law. (ii) The more words the network learns, t...
متن کامل